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Abstract 

We study the fc-wise independent relaxation of the usual model Q {N, p) of random graphs where, 
as in this model, N labeled vertices are fixed and each edge is drawn with probability p, however, it is 
only required that the distribution of any subset of k edges is independent. This relaxation can be rel- 
evant in modeling phenomena where only fc-wise independence is assumed to hold, and is also useful 
when the relevant graphs are so huge that handling Q {N, p) graphs becomes infeasible, and cheaper 
random-looking distributions (such as fc-wise independent ones) must be used instead. Unfortunately, 
many well-known properties of random graphs mQ{N, p) are global, and it is thus not clear if they are 
guaranteed to hold in the fc-wise independent case. We explore the properties of fc-wise independent 
graphs by providing upper-bounds and lower-bounds on the amount of independence, fc, required 
for maintaining the main properties of Q{N,p) graphs: connectivity, Hamiltonicity, the connectivity- 
number, clique-number and chromatic-number and the appearance of fixed subgraphs. Most of these 
properties are shown to be captured by either constant fc or by some fc = poly {\og{N)) for a wide 
range of values of p, implying that random looking graphs on N vertices can be generated by a seed 
of size poly{log{N)). The proofs combine combinatorial, probabilistic and spectral techniques. 
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1 Introduction 



We study the fc-wise independent relaxation of the usual model Q{N,p) of random graphs where, as in 
this model, labeled vertices are fixed and each edge is drawn with probability (w.p., for short) p = 
p{N), however, it is only required that the distribution of any subset of k edges is independent (in 
Q{N,p) all edges are mutually independent). These fc-wise independent graphs are natural combina- 
torial objects that may prove to be useful in modeling scientific phenomena where only fc-wise inde- 
pendence is assumed to hold. Moreover, they can be used when the relevant graphs are so huge, that 
handling Q{N,p) graphs is infeasible, and cheaper random-looking distributions must be used instead. 
However, what happens when the application that uses these graphs (or the analysis conducted on them) 
critically relies on the fact that random graphs are, say, almost surely connected? After all, A:-wise in- 
dependence is defined via 'local' conditions, so isn't it possible that /c-wise independent graphs will fail 
to meet 'global' qualities like connectivity? This motivates studying which global attributes of random 
graphs are captured by their /c-wise independent counterparts. 

Before elaborating on properties of A;-wise independent graphs we provide some background on k- 
wise independence, on properties of random graphs, and on the emulation of huge random graphs. 

1.1 Emulating Huge Random Graphs 

Suppose that one wishes to conduct some simulations on random graphs. Utilizing Q{N,p) graphs 
requires resources polynomial in A^, which is infeasible when is huge (for example, exponential in 
the input length, n, of the relevant algorithms). A plausible solution is to replace Q{N,p) by a cheaper 
'random looking' distribution Qn. To this end, each graph G in the support of Qn is represented by a 
very short binary string (called seed) s{G), s.t. evaluating edge queries on G can be done efficiently when 
s{G) is known; Then, sampling a graph from ^at is done by picking the seed uniformly at random. 

Goldreich et al. were the first to address this scenario in ll23l . They studied emulation by computation- 
ally pseudorandom graphs, that are indistinguishable from Q{N,p) from the view of any poly{log{N))- 
time algorithm that inspects graphs via edge-queries of its choice. They considered several prominent 
properties of Q{N,p) graphs, and constructed computationally pseudorandom graphs that preserve many 
of those properties (see the final paragraph of Section |2ll. 

We consider replacing random graphs by A:-wise independent ones. The latter can be sampled and 
accessed using only poly{k log(A^))-bounded resources. This is achieved thanks to efficient constructions 
of discrete /c-wise independent variables by Joffe [26], see also Alon, Babai and Itai [JJ: the appearance 
of any potential edge in the graph is simply decided by a single random bit (that has probability p to 
attain the value 1). Such A;-wise independent graphs were used by Naor et al. 135 ] to efficiently capture 
arbitrary first-order properties of huge Q{N,p) graphs (see Section [331 ). 

1.2 k-Wise Independent Random Variables 

Distributions of discrete A;- wise independent variables play an important role in computer science. Such 
distributions are mainly used for de-randomizing algorithms (and for some cryptographic applications). 
In addition, the complexity of constructing fe-wise independent variables was studied in depth, and in 
particular, the aforementioned constructions 1261 [D (based on degree k polynomials over finite fields) 
are known to provide essentially the smallest possible sample spaces. Our work is, however, the first 
systematic study of combinatorial properties of A:-wise independent objects. Properties of various other 
A;-wise independent objects (mainly percolation on Z"^ and on Galton- Watson trees) were subsequently 
explored by Benjamini, Gurel-Gurevich and Peled iH. 
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1.3 The Combinatorial Structure of Random Graphs 



What are the principal attributes of random graphs that fc-wise independent ones should maintain? Most 
theorems that manifest the remarkable structure of random graphs state that certain properties occur either 
almost surely (a.s. for short), or alternatively hardly ever, (namely, with probability tending either to 1 or 
to as grows to oo). These results typically fall into one of the following categories. 

Tight concentration of measure. A variety of prominent random variables (regarding random graphs) 
a.s. attain only values that are extremely close to their expectation. For instance, random graphs (with, 
say, constant p) a.s. have connectivity number k = {i±o(i))pN, chque number c = 

(Bollobas and Erdos HOl, Matula (Ml, Frieze (221) and chromatic number x = (i±o(i)) ^2'iog(pjvf ^ 
(Bollobas IH, Luczak ll33l ). 



Thresholds for monotone properties. For a given monotone increasing graph property T, how large 
should p(A^) be for the property to hold a.s.? This question had been settled for many prominent prop- 
erties such as connectivity (Erdos and Renyi |[T4l ). containing a perfect matching (Erdos and Renyi |[T6l 
[TtKIII), Hamiltonicity (Posa [37], Korsunov 1291], Komlos and Szemeredi 1,30.1). and the property of con- 
taining copies of some fixed graph H (Erdos and Renyi flSl, Bollobas lEl). For these (and other) graph 
properties the sufficient density (for obtaining the property) is surprisingly small, and moreover, a thresh- 
old phenomenon occurs when by 'slightly' increasing the density from p{N) to p{N), the probability 
that T holds dramatically changes from o(l) to 1 — o(l)ll Thus, good emulation requires the property T 
to be guaranteed at densities as close as possible to the true Q {N, p) threshold. 

Zero-one laws. These well known theorems reveal that any first-order property holds either a.s. or 
hardly ever for Q{N,p). A first-order property is any graph property that can be expressed by a single 
formula in the canonical language where variables stand for vertices and the only relations are equality 
and adjacency (e.g. "having an isolated vertex" is specified by 3xVy-iEDGE(x, y)). These Zero-one 
laws hold for any fixed p (Fagin 119], Glebskii, Kogan, Liagonkii and Talanov {24]), and whenever 
p{N) = N~" for a fixed irrational a (Shelah and Spencer [|39J ). 



2 Our Contribution 

We investigate the properties of /c-wise independent graphs by providing upper bounds and lower bounds 
on the 'minimal' amount of independence, kx, required for maintaining the main properties T of random 
graphs. The properties considered are: connectivity, perfect matchings, Hamiltonicity, the connectivity- 
number, clique-number and chromatic-number and the appearance of copies of a fixed subgraph H. We 
mainly establish upper bounds on kx (where arbitrary /c-wise independent graphs are shown to exhibit the 
property T) but also lower bounds (that provide specific constructions of /c-wise independent graphs that 
fail to preserve T). Our precise results per each of these properties are discussed in Section[3l and proved 
in Section[5](and the Appendices). Interestingly, our results reveal a deep difference between fc-wise inde- 
pendence and almost /c-wise independence (a.k.a. {k, e)-wise independenc^. All aforementioned graph 
properties are guaranteed by A;-wise independence (even for small k = poly{log{N))), but are strongly 

^Namely, any property closed under graph isomorphism and under addition of edges. 

^Thresholds for prominent properties are often so sharp that p — {1 + o(l))p. Somewhat coarser thresholds were (later) 
established for arbitrary monotone properties by Bollobas and Thomason 1 11 1, and by Friedgut and Kalai 121]. 

'(fc, e)-wise independence means that the joint distribution of any k potential edges is only required to be within small 
statistical distance e from the corresponding distribution in the Q{N,p) case. 
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violated by some almost A;-wise independent graphs - even when k = A^^(^) is huge and e = A^^^(i) is 
tiny. For some properties of random graphs, T, our results demonstrate for the first time how to efficiently 
construct random-looking distributions on huge graphs that satisfy T. 

Our Techniques & Relations to Combinatorial Pseudorandomness. For positive results (upper bound- 
ing kx), we note that the original proofs that establish properties of Q{N,p) graphs often fail for k- 
wise independent graphs. These proofs use a union bound over M = 2®*^^) undesired events, by 
giving a 2~^^^^ upper-bound on the probability of each of these events|l| Unfortunately, there exist 
poZt/(log(A^))-wise independent graphs where any event that occurs with positive probability, has proba- 
bility > 2^°^^\ Therefore, directly 'de-randomizing' the original proof fails, and alternative arguments 
(suitable for the fc-wise independent case) are provided. 

In particular, many properties are inferred via a variant of Thomason's notion of 'jumbledness' BOl 
(mostly known in its weaker form as quasirandomness or pseudorandomness, as defined by Chung, Gra- 
ham and Wilson [13], and related to the so called Expander Mixing Lemma and the pseudo-random 
properties of graphs that follow from their spectral properties, see [21)- For our purposes, a-jumbledness 
means that (as expected in Q{N,p) graphs) for all vertex-sets U, V, the number of edges that pass from 
U toV should be p I [/ 1 1 y I lb a v|?7||V^. Jumbledness and quasirandomness had been studied extensively 
(see m i and its many references), and serve in Graph Theory as the common notion of resemblance to 
random graphs. In particular, Q{N,p) graphs are known to exhibit (the best possible) jumbledness pa- 
rameter, a = @{^/pN). One of our main results (Theorem[T]l demonstrates that A;-wise independence for 
k = 0(log(A^)) is stronger than jumbledness, in the sense that it guarantees the optimal a = Q{^/pN) 
even for tiny densities p = 9( ^°y^ ). Therefore, prominent properties of fc-wise independent graphs can 
be directly deduced from properties of jumbled graphs. 

Proving Theorem [T] exploits a known connection between jumbledness and the eigenvalues of (a 
shifted variant of) the adjacency matrix of graphs, following the approach in [2 ]. In particular, the analysis 
of Vu ([,41 1 . extending [20J) regarding the eigenvalues of random graphs is strengthened, in order to 
achieve optimal eigenvalues even for smaller densities p than those captured by BTI . This improvement 
implies, among other results, the remarkable fact that fc-wise independent graphs for k = 0(log(A^)) 
preserve (up to constant factors) the Q{N,p) sufficient density for connectivity. 

More on Techniques & Relations to Almost fc-Wise Independence. For negative results (produc- 
ing random-looking graphs that defy a given property T of random graphs), the f 23l [36l approach is to 
first construct some random-looking graph G, and later to 'mildly' modify G s.t. T is defied. This is 
done w.r.t. all graph properties considered here. For instance, the modification of choosing a random 
vertex and then deleting all it's edges violates connectivity while preserving computational pseudoran- 
domness. Unfortunately, such modifications fail to preserve A;-wise independence (the resulting graphs 
are only almost fc-wise independent). In contrast, most of our negative results exploit the fact that some 
constructions of fc-wise independent bits produce strings with significantly larger probability than in the 
completely independent case. This is translated (by the construction in Lemma |5]l to the unexpected ap- 
pearance of some subgraphs (in A;- wise independent graphs): either huge independent-sets inside dense 
graphs or fixed subgraphs inside sparse graphs. 

Comparison with Computational Pseudorandomness. Finally, fc-wise independence guarantees all 
random graphs' properties that were met by the (specific) computationally pseudorandom graphs of 1231 
l36l . In addition, only fc-wise independence captures (i) arbitrary first-order properties of Q{N,p) graphs, 

''For instance w.r.t. connectivity, M is thie number of clioices for partitioning the vertices into 2 disconnected components. 
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(ii) high connectivity, (iii) strongest possible parameters of jumbledness, and (iv) almost regular (1 it 
o{l))pN degree for all vertices, and (1 it o{l))p'^N co-degrees for all vertex pairs. Importantly, all 
this holds for any /c-wise independent graphs, (and in particular for the very simple and efficiently con- 
structable ones derived from |[26l lTll). whereas the |[23ll36]| 's approach requires non-trivial modifications 
of the construction per each new property. 

3 Combinatorial Properties of k-Wise Independent Graphs 

We now survey our main results per each of the aforementioned graph properties T. Typically our 
arguments establish the following tradeoff: the smaller p is, the larger k should be to maintain T. Given 
this tradeoff we highlight minimizing k or, alternatively, minimizing p. The latter is motivated by the 
fact that the Q{N,p) threshold for many central properties occurs at some p* <C 1. Minimizing p is 
subject to some reasonable choice of k, which is k < poly{log{N)). Indeed, as the complexity of 
implementing fc-wise independent graphs is poly{klog{N)), we get efficient implementations whenever 
k < poly (log{N)) even when the graphs are hugeandiV = 2?'°'2^(").| 

3.1 Connectivity, Hamiltonicity and Perfect Matchings (see Section |5^ 

The well known sufficient G{N,p) density for all these properties is ~ ^^7^- For connectivity, this 
sufficient density is captured (up to constant factors) by all log(A^)-wise independent graphs. Even k = 4 
suffices for larger densities p » N~2. Based on Hefetz, Krivelevich and Szabo's [,25,1 . Hamiltonicity 
(and hence perfect matchings) are guaranteed atp > with k > 41og(A''), and at p > N~2~^°^^^ 

with k > A. On the other hand, some pair-wise independent graphs are provided that despite having 
constant density, are still a.s. disconnected and fail to contain any perfect matching. 

3.2 High Connectivity (see Section 15.31) 

The connectivity number, k{G), is the largest integer, i, s.t. any pair of vertices is connected in G by at 
least i internally vertex-disjoint paths. Since a typical degree in a random graph is (1 it o{l))pN, it is 
remarkable that Q{N,p) graphs achieve k = (1 it o{\))pN a.s.. Surprisingly, such optimal connectivity 
is guaranteed by 0(log(A^))-wise independence whenever p > 0(^^^^^). 

3.3 Cliques and Independent- Sets (see Appendix |7]) 

For A^^°(^) < p < 1 — N°^^^ the independence number, /, of random graphs has a.s. only two possible 
values: either S* or S** + 1 for some S* ~ 2\og{pN) ^ rpj^-^ remarkable phenomenon is observed 

log(l/(l-p)) ^ 

to hold by Q{\og^{N)) -wise independence whenever p is bounded away from 0. On the other hand, 
/c-wise independent graphs are provided with k = Q ^ log'iogCAf ) ) where / > (S*)^^^^^) a.s. (for k = 
G(l), even huge A^^(i) independent-sets may appear). For smaller densities, random graphs a.s. have 
I < 0{p^^\og{N)), while 6 (log (A^)) -wise independence gives a weaker, yet useful, / < 0{^J~^) 

bound whenever p > ^}{ ^°^^^ ). By symmetry (replacing p with 1 — p), analogous results to all the 

above hold for the clique number as well. Discussing the clique- and independence-number is deferred 

to the appendices since the main relevant t echniques here are demonstrated elsewhere in the paper. 

^Accessing the graphs via edge-queries is adequate only when p > n~®'^' - otherwise a.s. no edges are detected by the 
polyin) inspecting algorithm. For smaller densities our study has thus mostly a combinatorial flavor. 
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3.4 Coloring (see Section |531) 



For ^ p < 1 — the chromatic number x of random graphs is a.s. (1 + o{l)) 2'iog(pAf) 

This Q{N,p) lower-bound on x is observed to hold for any (log(A^))®'^^)-wise independent graphs with 
moderately small densities p > (log(A^))^®(^). More surprisingly, k = ©(log(A^)) suffices to capture 
a similar upper-bound (even for tiny densities p = clog{N)/N). This upper-bound is based on Alon, 
Krivelevich and Sudakov's IS, H and on Johansson's 1271 . 

3.5 Thresholds for the Appearance of Subgraphs (see Section |5^ 

For a fixed (non-empty) graph H, consider the appearance of i7-copies (not necessarily as an induced 
subgraph) in either a random or a fe-wise independent graph. The Q {N, p) threshold for the occurrence 

of H sub-graphs lies at p*^ =^ N~p, where the constant p = p{H) is the minimum, taken over all 
subgraphs H' of H (including H itself), of the ratio (here, v{H') and e{H') respectively denote the 
number of vertices and edges in H'). Thus, no if-copies are found when p <^p* , while for any p ^ p*, 
copies of H abound (Erdos and Renyi fT5\, Bollobas [8]). For any graph H, this Q{N,p) threshold holds 
whenever k > Q{v^{H)), but as k is decreased to [|J, the G{N,p) threshold is defied: much sparser 
graphs exist where p <^ p*^ and yet copies of H are a.s. found. In particular, when e{H) > D.{v'^{H)), 
the threshold violation occurs at A; = il.{v{H)). 

3.6 First Order Zero-One Laws (Previous Results) 

Naor et al. |f35l have recently studied capturing arbitrary depth-D(A^) properties. These are graph proper- 
ties expressible by a sequence of first-order formulas <I> = {(/lAf Iatgn. with quantifier depth depth{cl)j\^) < 
D{N). A 'threshold' depth function D* ~ iog^(]yp) was identified s.t. arbitrary A:-wise independent 
graphs resemble Q {N, p) graphs w.r.t. all depth D* properties. The underlying resemblance-definition is 
in fact so strong, that even Q{N,p) graphs cannot achieve resemblance to themselves w.r.t. properties of 
higher depth. On the other hand, fc-wise independent graphs were shown to defy some G {N, p) properties 
of depth Q{\/k\og{N) +log(A^)). These results are incomparable to the ones in the current paper, since 
most of the graph properties studied here require larger depth than D* . 



4 Preliminaries 

Asymptotics. Invariably, A; : N ^ N, while p, e, (5, 7, A : N — > (0,1). We often use A;,p, e, 7, A 
instead of k{N),p{N),e{N),6{N),'y{N),A{N). Asymptotics are taken as ^ 00, and some in- 
equalities hold only for sufficiently large A^. The [-J and [•] operators are ignored whenever insignificant 
for the asymptotic results. Constants c, c are not optimized in expressions of the form k = clog(A^) or 
p = {log{N)y/N^, whereas the constant A is typically optimized. 

Subgraphs. For a graph H, let v{H) and e{H) denote the number of vertices and edges in H. For 
vertex sets U, V let e{U, V) denote the number of edges that pass from U to V {if S = U f]V ^, then 
any internal edge of S is counted twice). Similarly, we let e{U) = e{U, U). 

Random and k-Wise Independent Graphs. Throughout, graphs are simple, labeled and undirected. 
Given N,k,p as above then g''^^\N,p{N)) (or Q^{N,p) for short) denotes some distribution over the 
set of graphs with vertex set {1, N}, where each edge appears w.p. p{N), and the random variables 
that indicate the appearance of any k{N) potential edges are mutually independent. We use the term 
'fe-wise independent graphs' for a sequence of distributions {Q^{N,p)} N&i indexed by A^. 
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Almost Sure Graph Properties. A graph property T, is any property closed under graph isomorphism. 
We say that 'T holds a.s. (almost surely) for Q^{N,p)'' or that (abused notation) 'T holds for Q^{N,py 

whenever Prgfe(;Y,p) [^] '^-^^ 1. Similar terminology is used for Q {N, p) graphs. 

Monotonicity in (k, p). Since fc-wise independence implies fc-wise independence for all > A; we 
may state claims for arbitrary k > k' but prove them only for k = k'. When establishing monotone 
increasing properties we often state claims for arbitrary p > p' but prove them only for p = p'. The latter 
is valid since for any N,k,p > p', the process of sampling from any (independent) Q^{N,p), Q^{N,p' /p) 
distributions and defining the final graph with edge-set being the intersection of the edge-sets of the two 
sampled graphs, clearly results in a Q^{N,p') distribution. 

k-Wise Independent Random Variables. The term '(M, A;,p)-variables' stands for any M binary 
variables that are fc-wise independent with each variable having probability p of attaining value 1 . Lemma 
[I (proved in Section 16.11) adjusts the known construction of discrete /c-wise independent variables of 
ll26l . |[T2l . ifTl to provide (M, A;, p)- variables that induce some predetermined values with relatively high 
probability. Throughout, ei and eo resp. denote the number of edges and non-edges in a graph H. 

Lemma 1 Given < p < 1 with binary representation p = 0.bi...bi, and natural numbers eo,ei,M 
satisfying cq + ei < M, let F = max{2r'°S2 Afl ^ 2^}. Then there exists {M, k, p)-variables s.t. Y'i[A] = 
F~^, where A denotes the event that the first cq variables receive value while the next ei variables 
receive value 1. 



Tail Bounds for k-Wise Independent Random Variables. The following strengthened version of 
standard tail bounds (proved in Section [d!2l ) translates into smaller densities p for which monotone graph 
properties are established for A;-wise independent graphs. 

Lemma 2 Let X = "^fLi -^j ■^"'^ ofk-wise independent binary variables where Pr[-^j = 1] = fM 

holds for all j. Let 6 > 0, and let k be even s.t. *^r^ ju(l — ^) > 1. Then 



Pr[|X -E(X)| > 5E{X)] < 



2k{l- fi) 



5 The properties of k-wise independent graphs 
5.1 Degrees, Co-Degrees and Jumbledness 



Lemma 3 (Achieving almost regular degrees) In all k-wise independent graphs {Q^{N,p)}N^fi it 

- 3k 1 



a.s. holds that all vertices have degree p{N — 1) (1 it e) whenever N [-^r^] ^^^"^^ — > 0, and in particular 



when either 



L k>4, < p < 1 _ 5 1 > g ^ ^-1/2^-1/4. 



or 



2. k>Alog{N),^^^<p<l-'-^,andl>e>^^^^. 

Proof. Fix a vertex v, and let be the random variable that indicates the appearance of the edge {v, w} 
in the graph. Thus, the degree of v is X = X^w^^ -^w Since X is the sum of (A^ — 1, variables, 
Lemma|2]implies that the probability that v has an unexpected degree X / p{N — l){l±e) is bounded by 
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l-^^v] ■ Applying a union-bound over the possible vertices v, gives that the probability of having 

r '^h n I 1 

some vertex with unexpected degree is bounded by [75^ J > which vanishes for the parameters in 
items 1 and 2. ■ 

Lemma 4 (Achieving almost regular co-degrees) In all k-wise independent graphs {Q^ {N, p)}7veN it 

a.s. holds that all vertex pairs have co-degree p'^{N — 2)(1 it 7) whenever either 

1. k> 12, iV-5 <^ p < 1 - ^, and I > 7 >p"^iV"S; or 

2. > 12 log(iV), ^I^MAO < ^ < 1 _ laiogW andl>^> 

Proof. The proof is completely analogous to that of Lemma [3l Here the union-bound is over all (^) 
vertex pairs {u, u}, and the co-degree of each {u, v} is the sum of (A'^ — 2, [|j ,p^)-variables. ■ 

The following definition is a modified version of the one in ||40l[T3l . see also HI and lH, Chapter 9. 

Definition 1 (Jumbledness) For vertex sets U, V, let e{U, V) denote the number of edges that pass 
from U to V (internal edges of U are counted twice). A graph is {p,a)-jumbled if e{U,V) = 
p\U\\V\±ay/\U\\V\ holds for all U,V. 

Theorem 1 (Achieving optimal jumbledness) There exist absolute constants ci, C2, C3 s.t. all k-wise in- 
dependent graphs {^'^(A^,p)}7V6N <^re a.s. (p, a)-jumbled whenever either: 

1. k>A,p> VL{jj) and a > ^N^/^; or 

2. k> \og{N), ^iif^ <p<l- ^^iiHslM and a > csVpN. 

Proof. The proof is based on spectral techniques and combines some refined versions of ideas from f2l|, 
|[20]| and pT]. using the fact that traces of the A;-th power of the adjacency matrix of a graph are identical 
in the fc-wise independent case and in the totally random one. The details are somewhat lengthy and are 
thus deferred to Appendix [H 



5.2 Connectivity, Hamiltonicity and Perfect Matchings 

Theorem 2 (Achieving connectivity) There exists a constant c s.t. the following holds. All k-wise inde- 
pendent graphs {G''{N,p)}]\f(=f^ are a.s. connected whenever either: 

• k > 4 and p » or 

• k> 41og(iV) and p > 

Proof. Let [/ be a vertex-set that induces a connected component. Connectivity follows from having 
\U\ > 0.5N for all such U. The following holds a.s. for Q^{N,p). By Lemma [3l all vertices have 
degree > 0.9piV, so e{U) > 0.9pN\U\. By Theorem [B all sets U satisfy e{U) < p\U\'^ + a\U\ with 
a = 0{^/pN) = o{pN). Re-arranging gives (0.9 - o{l))N < \U\. ■ 

Theorem 3 (Achieving Hamiltonicity) All k-wise independent graphs {Q^{N,p)}NeN cire a.s. Hamil- 
tonian (and for even N contain a perfect matching) whenever either: 

• /c > 4 and p > ; or 
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• k> 41og(iV) and p > i^4r^- 

Proof. Let T{V) denote the set of vertices v ^ V that are adjacent to some vertex in the vertex-set V. 
By Theorem 1.1 in Hefetz, Krivelevich and Szabo's |[25]| . Hamiltonicity follows from the existence of 
constants 6, c such that a.s. (i) |f > 12|y| holds for all sets V of size < bN, and (ii) e{U, V) > I 
holds for all disjoint sets U, V of size i^^j^- We remark that (unhke other asymptotic arguments in this 
paper), the sufficiency of (i) and (ii) might hold only for very large N. For (i), let b = and consider 
an arbitrary set V. By Theorem [TJ a.s. all vertex-sets T have e(T) < p\T\'^ + o{pN)\T\. By Lemma 
[3] a.s. all the degrees are (i±o{i))pN, so exactly {i±o{i))pN\V\ edges touch V (where internal edges are 
counted twice). Let T = F U r(T^), and assume that |f(y)| < 12\V\. We get {i~o(i))pN\V\ < e{T) < 
p{13\V\)'^ + o{pN)\V\. Re-arranging gives \V\ > Condition (i) follows. For (ii), by Theorem 
[U a.s. all (equal-sized and disjoint) vertex-sets U, V have e{U, V) > p\U\\V\ — 0{^/pN)\U\. If there 
is no edge between U and V, then e{U,V) = 0. Re-arranging gives \U\ < 0{\/N/p) < 0{j^^^^). 
Condition (ii) follows. ■ 

Theorem 4 (Failing to preserve connectivity) There exist pair-wise independent graphs {Q'^ {N, p) } Af gpj 

where p = 1/2, that are (i) a.s. disconnected (and contain no Hamiltonian cycles), and that (ii) contain 
no perfect matchings with probability 1. 

Proof. Consider the graphs defined by partitioning all vertices into 2 disjoint sets Vo,Vi where each 
Vj induces a clique, no edges connect Vq to Vi, and Vi is chosen randomly and uniformly among all 
subsets of odd cardinality of the vertex set. Note that for every set of 4 vertices, there are 16 ways to 
split its vertices among Vq and Vi, and it is not difficult to check that if > 5, then each of these 16 
possibihties is equally likely. Therefore, any edge appears w.p. i, and any pair of edges (whether they 
share a common vertex or not) appears w.p. j. Still the graph is connected iff all the vertices belong to 
the same Vj which happens only w.p. 2~^+^ (and only if is odd). Since | Vi| is odd, the graph contains 
no perfect matching. ■ 

Note that when p is slightly increased to 1/2 + N^^^^\ then 4-wise independence suffices for 
achieving Hamiltonicity (via Dirac's Theorem), because then a.s. all vertices have degree > N/2. 

5.3 High-connectivity 

Theorem 5 (Achieving optimal connectivity) There exists an absolute constant c, s. t. for all k-wise in- 
dependent graphs {G'^{N,p)}]y^f^ the connectivity number is a.s. {i±o(i))pN when either 

• k > 4 and p » N~s; or 

• k> \og(N) and p > c^P-. 

Proof. The connectivity is certainly not larger than {\ + o{l))pN , as it is upper-bounded by the minimum 
degree. By Theorem 2.5 in Thomason's [4(Jl k > d — a/p holds for any (p, a) -jumbled graph with 
minimal degree > d. Thus, achieving k > pN, reduces to obtaining (i) d = (1 it o(l))pN, and (ii) 
a <C pd. Condition (i) a.s. holds by Lemma|3] By Theorem[T] we a.s. achieve ( C3 y/ pN ) -j umbledness 
for some constant C3, so condition (ii) becomes p'^N ':$> ^JpN. This proves the first part of the theorem. 
To prove the second we note, first, that we may assume that p ^ 1 (since otherwise 4-wise independence 
suffices). Let 5" be a smallest separating set of vertices, assume that \S\ is smaller than (1 — o{l))pN, 
let U be the smallest connected component of G — S and let W be the set of all vertices but those 
in [/ U S*. Clearly \W\ > {\ - o{l))N. Note that e{U,W) = 0, but by jumbledness e{U,W) > 
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- C3y^pN\U\\W\. This implies, using the fact that \W\ > N/3, that \U\ < Using 

jumbledness again, e{U, S) < p\U\\S\ + C3y^pN\U\\S\ but as all degrees are at least (1 — o{l))pN, 
e{U,S) > (1 - o{l))pN\S\ - e{U) > (1 - o{l))pN\U\ -pjt/p - C3VpN\U\ > \U\{1 - o{l))pN, 



where here we used the fact that \U\ < 0{l/p) and that ^/pN = o{pN). This imphes that either 
p\U\\S\ > i|[/|pA^, implying that \S\ > N/2 > pN, as needed, or C3^pN\U\\S\ > i|C/|piV, implying 
that l^l > ^\U\pN which is bigger than pN provided \U\ > 9cn. However, if |?7| is smaller, then 

surely l-SI > (1 — o{l))pN, since all degrees are at least (1 — o{l))pN and every vertex in U has all its 
neighbors inU U S. M 

5.4 Thresholds for the Appearance of Subgraphs 

For a fixed non-empty graph H, let p{H) and pj^ be as in Section [331 

Observation 1 (Preserving the threshold for appearance of sub-graphs) There exists a function D{v) = 
{i±o(i)) Yg s.t. for any graph H with at most vvertices, and for all k-wise independent graphs {Q {N,p)}]\[^^ 
with k > D{v) the following holds. Let A denote the event that H appears in Q^{N,p) (not necessarily 
as an induced sub-graph). Then 

• IfpiN) < p^iN) then (^A) a.s. holds. 

• IfpiN) » p*HiN) then A a.s. holds. 

Proof. The proof (given in Appendix 16.31 ) applies Rucinski and Vince's |[38l to derive a lower-bound on 
the minimal k sufficient for the original Q [N, p) argument to hold. ■ 

Theorem 6 (Defying the threshold for appearance of sub-graphs) For any (fixed) graph H that sat- 
isfied p{H) < 2, there exists k-wise independent graphs {G''{N,p)}]\j^^ where k = l'^;^ ~ ll ^'^^ 
p(N) <C p*fj{N) s.t. H a.s. appears in Q^(N,p) as an induced sub-graph. 

Proof. Theorem [6] relies on Lemma \5\ This lemma considers the appearance of the sub-graph H]\i in 
Q^{N,p) where {Hn}^^^ is any sequence of graphs (possibly) with unbounded order. 

Lemma 5 (fc-wise independent graphs with unexpected appearance of sub-graphs) Let {ffTvlTVeN 
be a sequence of graphs where has exactly S{N) < \/N vertices, ei{N) edges and eo{N) none- 
edges. Assume that for each N there exists (('^^^^) , k{N) , p{N))-variables s.t. with probability A(N) ^ 
{S{N) /N)"^ it holds that the first eo{N) variables attain value and the next ei (N) variables attain value 
1. Then there exist k-wise independent graphs {Q^{N,p)}j\^(zfq that a.s. contain HN-copies as induced 
sub-graphs. 

Proof (Lemma mi. Fix N,so H = Hn,S = S{N),ei = ei{N),k = k{N),p = p(iV), A = A{N). 
We construct graphs Q^{N,p) that a.s. contain H copies. Given the N vertices, let {Vj}j^i be any 
maximal collection of edge-disjoint vertex-sets, each of size \ Vj\ = S. For each j, decide the internal 
edges of Vj by some ((2), A;, p)- variables s.t. H is induced by Vj with probability A. This can be done 
by appropriately defining which specific edge in Vj is decided by which specific variable. Critically, the 
constructions for distinct sets Vj are totally independent. The R = (^) — M (2) remaining edges can be 
decided by any (R, /i;,p)-variables. The resulting graph is clearly A;-wise independent. 

*This condition rules out only graphs H that are a collection of disjoint edges. For such graphs p{H) — 2, so clearly no 
iZ-copies can be produced (even if = 1) when p{N) ^ p'^{N) — N~'^. 
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The main point is that (i) the events of avoiding ff-copies on the various sets Vj are totally indepen- 
dent (by the edge-disjointness of the Vj-s), and that (ii) in our fc-wise case A is rather large (compared 
with the totally independent case). Thus, avoiding ff-copies on any of the Vj-s is unlikely. Indeed, let 
B denote the event that no //-copies appear in the resulting graph, while B' only denotes the event that 
none of the Vj-s induces H. By Wilson's [43J and Kuzjurin's [32] we have M = Q{N'^/S'^), so 

Pr [B] < Pr [B'] = (1 - A)^'^ < e~®(^) , 
which vanishes by our requirement that A » (S/N)"^. ■ (Lemma|5]) 

Completing the proof of Theorem |6l For v = v{H),p = p{H),p* = p*^, and some 1 ^ f{N) < 
N°^^\ define p s.t. p~^ is the minimal power of 2 that is larger than, As desired p <^ p*. Let ei and 
eo respectively denote the number of edges and non-edges in H. With M = (g) and F = 1/p, we apply 
Lemma[T]to produce (M, fc,p)- variables s.t. with probability > F^'^ the first eo variables have value 0, 
and the remaining ei variables have value 1. By Lemma|5l the latter immediately implies the existence of 
A;-wise independent graphs that a.s. contain //-copies as long as F'^ <ti (N/v)'^. As F = 1/p = NP^°^^\ 
this <^ requirement translates to kp <2. M (Theorem^ 



5.5 The Chromatic Number 

Observation 2 (Preserving the chromatic-number lower bound) For any c > there exists some 
d > 0, s.t. all k-wise independent graphs {G^{N,p)}N(zfq with (\og{N))^^ < p < I — N~°^^^ and 
k > d(log(A^))'^^^ a.s. have chromatic number x > ^2°og^pjV)^^ • 

Proof. Let /(G) denote the independence number of (a single) A^-vertex graph G. Clearly, x{G) > 
so observation [2]follows from our k-wise independence upper-bound on I (observation |3l). ■ 

Theorem 7 (Preserving the chromatic-number upper bound) There exists an absolute constant c 
s.t. the following holds. All k-wise independent graphs {Q''{N,p)}NeN withp < 1/2 a.s. have chromatic 
number x < ^^io|(piV) whenever either: 

L k > 12 and p > N^n; or 

2. k> log{N) andp > c-^^. 

Remark. No special effort was made to optimize the constants ^ and 

Proof (sketch). Since p is bounded from above and log(l/l — p) — > pj ln(2), it suffices to show that 
a.s. X < G^ \o^{pN) )• ^'■^"^ ^ based on Alon, Krivelevich and Sudakov's 13]. Specifically, choose 
b = 1/25, s.t. by item 1 in Lemma [3] (with e = {log{N))p^^/'^N~'^/^) and by item 1 in Lemma |4] 
(with 7 = {log{N))p^^N'^/^), a.s. all the degrees are lower bounded by pN{l - p~V2j\r-3/8+o(i)^ > 
pN - N^-'^^ and all co-degrees are upper bounded by p'^N{l + p-iiV-i/6+o(i)) < _ ]<^i-iS ^ gy 

Theorem 1.2 in 13], these conditions (with 5 < 1/4 andp > iV^i) imply that x < < 0( iog^(^jv) )- 

Item 2 follows from jumbledness and the main result of [4J (which is based on 1,27.1 ). by which any 
graph with maximum degree d in which every neighborhood of a vertex contains at most d^~^ edges (for 
some constant P) has chromatic number x < ^(lEfd)- ' 
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6 Appendix - Detailed Proofs 



6.1 Modified Construction of k-Wise Independent Variables - Proving Lemma U 

Recall that given any prime power F, the original |[26l [T2l [H construction considers the field F with 
elements {0, ...,F — 1}, and for each element j G F, a random variable Zj is defined, s.t. the Zjs are k- 
wise independent, and each Zj is uniformly distributed in {0, F — 1}. We derive from those Zj-s some 
(M, k, p) binary variables Xj, by setting (i) Xj = 1 iff > 1—pforj = 1, cq, and (ii) Xj = 1 iff 
< p for j = Co + 1, Co + ei- Evidently, the Xj-s are A;-wise independent with Pr(Xj = 1) = p. 

Recall that Zj =^ with Q being a uniformly random degree k polynomial over F, and let B denote 
the event that the 0-polynomial was chosen. Since B implies A, we get Fr[A] > Fr[B] = F~^. ■ 

6.2 k-Wise Independence Tail Bound - Proving Lemma |2] 

Let Xi =^ X^- n and X =^ Y^iU Xi, so X - E{X) = X. Thus, 



Ft[\X - K{X)\ > 5E{X)] = Pr[\X\ > 6E{X)] 
= Pt[X'' > {6E{X)f] < ^^^^^ 



{5E{X)Y 



the last equality holds for any even positive k, while the < employs Markov's inequality. 
We bounds = E{X'') using the expansion 



where D = {d = {di, ...,dA/)| E»=i di = k,di> 0}. The k -wise independence of the variables Xi, 
guarantees the fc-wise independence of the XjS, so 

s = ^ E(nfiiXf ) = nf£iE(xf ). 

d€D deD 

Next, since E(Xj) = we can ignore all terms where di = 1 for some i. Namely, we consider only terms 
n = n^^-^E(X.^''^) where for some J < |, it holds that precisely j variables appear and for each variable 
Xi^ in n we have d,^ > 2. Hence, 

k 

whenever bounds the contribution of all terms 11 with precisely j variables. 
Strengthening standard versions of the inequality begins by taking 

= Q:')/Ki-/^)F- (2) 
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Indeed, clearly bounds the number of terms IT with precisely j variables, while — fi)]^ bounds 

the expectation of each term n that has precisely j variables because 



< ^(i-^)[(i-^)i + ^i] = ^(i-^) 

(the final < applies the facts 0</x,l — /x<l and d>2). Thus, multiplying over the j terms gives 



(3) 



Observe that is maximized when j = |. Indeed, 



M-j fj + l 



vtr . 



> 



J + 1 
M-k 
k 



J 



H{1 - n) 



(the concluding > 1 holds by the lemma's assumption). 
Thus, the maximal ^'j is 



2 



< 



< 



M 
k 

2 



7T W-I^)V- 



(I)! V2 
(eM)l 

v^(l) 

sf^k 



I) [Mi-M)]^ 



(Stirling's approximation for (|)! implies the last <). 
To summarize, all the above gives 

Pr[|X-E(X)| > (5E(X)] < 



(fc/2) 



{5¥.{X)f 

^(iMMl-/.))i 



< 



< \ - 









fk 


r§fc(i-M)l 


k 
2 


V TT 







The Lemma follows as it can be directly shown that for all k 



k 
2 

< 22. 
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6.3 Appearance of Subgraphs - Proving observation [T] 

We first consider only balanced graphs H, namely graphs where p{H) < p{H') for any subgraph H' C 
H. The original ^(A^, p)-threshold proof fT5^ takes a fixed graph F as a parameter, and considers for 
each set T of v{F) distinct vertices the random variable Y,f which indicates whether T spans F in the 

resulting graph. Thus =^ Y2t counts the number of sets that span F. 

First, the authors of |15l consider a specific subgraph H' C H s.t. p{H) = and show that 

p <^ p*jj implies that E,{Y^') <^ 1. In this case, H' rarely appears in Q{N,p) graphs and so does H. 
On the other hand, whenever p » p*^, they show that E(y^) » 1 and by Chebyshev's inequality it 
is deduced (only here the fact that H is balanced is used), that some iZ-copies appear. Thus, the entire 
argument applies only probabilities regarding either a single variable Y^ , or a pair Y^ ,Y^, of variables, 
and relies only upon the independence of sets of m < 2("^^^) edges. 

For non-balanced graphs the p <^ p*fj part holds as for balanced ones. For p ^> p*^, we rely on the fact 
that for any graph H, there exists an extension graph H C H" s.t. H" is balanced and p{H") = p{H) 
(Rucinski and Vince [38]). Since p » N^f^^^ means that p 3> N^p^^ \ and since H" is balanced, 
then Q{N,p) graphs a.s. contain copies of H" , and copies of H appear as well. This time the Chebyshev 
argument assumes only the independence of sets of m < 2("^^ ^) edges. Since by ||38l there exists H" 

as above with v{H") < (1+0(1)) ^"^^^^ , then m = (i=bo(i)) ^"^^^^ suffices. ■ 

7 Appendix - The Independence Number of k-Wise Independent graphs 

The following positive result follows the argument used to establish observation [T] 

Observation 3 (Preserving random graphs' precise independence-number) Consider arbitrary k- 
wise independent graphs {Q''{N,p)]N&i where A^-°(^) < p{N) < 1 - N-°'^'^\ and let I{N) = 
I[Q^[N,p)) denote the independence number of Q^{N,p). Then there exists a function S*{N,p) = 

(i-°(^)) iog('i/(i-i)) ' ^^^(^) ^ S*{N,p)+2, thena.s. I{N) < S*{N,p) + l, andifk{N) > (^*(^'^)), 
then a.s. I{N) > S*{N,p). 

Proof. The classical proof ( lITOl . ll34l ) of this claim for Q{N,p) graphs considers for each set T of 
S distinct vertices (5 being a parameter) the random variable which indicates whether T spans an 
independent set in the resulting graph. Thus 1""^ = Yr^ counts the total number of independent sets 
of size S. It is shown that for 5 = S"* + 2 then E(y'5) <C 1 so a.s. the independence number < 5"* + 1. 
On the other hand, for S = S* then K(Y'^) » (1), and by Chebyshev's inequality it is deduced that 
a.s. some independent sets of size S* appear. This entire argument considers only probabilities regarding 
either a single variable Y^ (for the lower- and upper-bound on /), or a pair Y,^ , Y^, of variables (for the 
lower-bound). Therefore, the upper-bound holds for all fe-wise independent graphs with k > S* + 2, and 
the lower-bound holds whenever A; > 2('^ ). ■ 

We next provide our negative results. Since the complexity of known constructions of /c-wise inde- 
pendent variables critically depend on the length, l{p), of the binary representation ofp = 0.bi...b£, it is 
reasonable to focus on densities with bounded length. The argument used here was already applied in the 
context of Theorem [6l 

Theorem 8 (K-wise independent graphs with huge independent sets) Let 5, : N — > and 

p : N ^ (0, 1) satisfy S{N) < Ar(wn-) and l{p{N)) < 2 log {S{N)). Then there exist k-wise inde- 
pendent graphs {Q^{N,p)}N^^ that a.s. contain independent sets of size S{N). 
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Proof. By Lemma[Il since i{p) < 21ogS', we get ((f), A;, p)- variables s.t. the probability that all vari- 
ables receive value is A > 5'"^'^. From this, Lemma|5]gives fe-wise independent graphs that a.s. contain 
independent-sets of size S, whenever S'^ <^ AN'^ . ■ 

Corollary 1 Let {S,k,p) be as in Theorem^ with il(l) < p{N) < 1 - N^°^'^\ and with S* as in 
observation\3\ Fix c > 1. Then there exist k-wise independent graphs {^'^(A^,p)}jveN where k{N) > 
(i-o(i)) eiog^il^(jv) that a.s. contain independent sets of size S » {S*{N)y. 

Proof. It suffices to provide an integer S s.t. : (i) S ^ {S*Y (the desired outcome) and (ii) S <^ 

k + l ) 

(the sufficient condition for applying Theorem[8]l. Such S clearly exists as long as 

{S*f (4) 

Define r by S* = W . Since N^^^) = 2, then any choice of f{N) > 1 gives A^iosC^') > 1. Thus, 
equation (HJl translates to having (for some f{N) » 1) 

cr<^-l^ (5) 
- k + l \og{N) ■ 

1 

Since p is bounded from 0, then S* < 0{log{N)) so (again using N^°s(n) = 2) 

clog(5*) ^ cloglog(iV) + 0(l) 
log(Ar) - log(iV) 

All this is valid in particular when 1 <^ f{N) <^ log log(A^), so equation ^ becomes 

1 ^ f{N) cloglog(Af) _ 

>cr+ / \' = (i+o(i))- ^ - 



k + l - log(iV) log(iV) 

The following upper-bound for the independence-number is larger than the bound of observation [3l 
yet holds for significantly smaller densities p. 

Theorem 9 (Independence-number upper bound) There exist constants ci, C2 s.t. for any k-wise in- 
dependent graphs {Q''{N,p)}NeN the following a.s. holds. There are no independent-sets of size S 
whenever either: 

1. S" > p-i/2iV3/4, k>4 and p^ N-^'"^; or 

2. S>ci^, k> log(7V) and p > H^M^. 

Proof. By TheoremlH a-jumbledness is a.s. achieved. For item 1 we have a 3> ^N^^'^. For item 2 we 
have a = 0{^JpN). Then, any vertex set U satisfies e{U) > p|J7p — a\U\, so if U is independent, then 
\U\ < a/p. m 
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8 Appendix - k-wise independence guarantees optimal jumbledness 



This appendix is dedicated to proving Theorem [T] Given an A'^-vertex graph G, consider the complete 
graph G, with weight 1 — p on any edge that appears in G, and weight —p on any other edge and on any 
self loop. Let A = A{G) denote the corresponding N x N matrix where Au^w = I — p whenever u, w 
are adjacent in G and Au^w = —p otherwise (including the case u = w). 

Let A = A(G) denote the largest eigenvalue in absolute value of A. By the argument in [2] G is 
A-jumbled. Indeed, for any two sets of vertices U and W, if we let xu and xw denote the characteristic 
vectors of U and W, respectively, then x\jAxw = e{U, W) — p\U\\W\ and as the ^2-norm of Axw is 
at most Ai/|W^, and that of xu is ^/M' it follows by Cauchy-Schwarz, that \e{U, W) - p\U\\W\\ = 
\xIjAxw\ < Ay^|C/||1^|, as needed. 

Let r = {vq vi ^ ■ ■ ■ vr = vq) be an arbitrary closed walk with R steps in G. Throughout, 
r may repeat vertices and edges and may traverse self-loops. Let VF(r) = Ylf=o ^Vj,vj+i, and let 

By Wigner's trace argument ll42l . for any graph distribution, and any even R> A and » 1, a.s. 

A < {ujE{X))^/^ (6) 

(E(-) stands for expectation). Thus, establishing the desired jumbledness reduces to bounding E = E(X). 
We first fix t, j, and bound the contribution to E of a single walk, T, that traverses exactly t vertices and 
j edges; later we bound the number of walks with such t,j. Let {ei, ej} denote the set of all edges 
(excluding self-loops) used by T, where is traversed precisely qi > I times (we don't care how many 
times ej = {u, w} is traversed specifically from u to w or from w to n). As long as k > R then the 
contribution of T to E is bounded by E'(r) = IlLibll " P)'^' + (1 - p){-py']- The latter equals if 
some Qi = 1, so we focus on walks where each is traversed at least twice. Then, 

j 

E{r) < Hipii - pf + (1 - p){-pf] < p^. (7) 

i=l 

Proving Theorem [IJ item 1. Let k = R = 4. There are only 2 types of walks: Walks with 3 vertices 
contribute 0{p'^N^) to E, and walks with 2 (or 1) vertices contribute only 0{pN'^), which is dominated 
by the 3-vertex walks' contribution. By for any a; » 1 a.s. A < {uj{p'^N^)Y/^ . ■ (Item 1) 

Proving Theorem [H item 2. We adopt the approach of Fiiredi-Komlos-Vu ll20ll41] . who bound the 
number of walks with given (t, j), by encoding the walks in a 1:1 manner, and then bound the number of 
code- words. We first describe their encoding scheme (Section ISTI ) and later refine it (Section [8^ . 

8.1 The Fiiredi-Komlos-Vu encoding 

Fix r and consider the spanning-tree T of P, which consists of all the vertices visited by F and exactly 
those edges through which F visits a vertex for the first time. Edges (and consequently, steps) in F are 
either internal (e € T), or external (e ^ T). A step leading to a new vertex is called positive. A step 
traversing an internal edge for the 2'nd time is called negative. Any other step is called neutral (thus, 
all (-I-) steps are internal, and neutral steps are either external, or pass through some internal edge for the 
i'th time i > 3. Steps on self-loops are external). The encoding of F is composed of: 

• A list of all t vertices visited by F, ordered by their first appearance. 

• A string of length R, where the i'th position encodes the i'th step as follows. 
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• Each positive step is encoded by (+). 

• Each negative step is encoded by (-). 

• Each neutral step (n v) is encoded by (v). 

How is r retrieved from its encoding? The starting vertex is known, since the order in which the 
vertices appear in T is known. Assuming that the current position in the walk is known, then the next 
position is also known if the next step is either neutral or positive. Ambiguity is possible only when we 
are about to traverse a (-) step, and in addition the walk is currently at a critical vertex x. This means 
that the number of internal edges ei, that touch x, and have been traversed exactly once (up to this 
point) is > 2. For example, consider a walk starting with 1^2^3— >1^4^5— >1. At this 
point, x = 1 is critical since both edges ei = {1,2} and 62 = {1,4} were traversed exactly once. If a 
(-) step immediately follows, it is not clear to which ej this current (-) refers. The encoding in ll20ll4TI is 
modified in some way s.t. critical steps can be decoded un-ambiguously and the entire encoding-scheme 
becomes 1: 1, as desired. By Theorem 1.5 in ll4n this suffices for proving Item 2 in our theorem whenever 

For smaller p, we must refine the original encoding significantly. 

8.2 Our refined encoding 

We start with a simple observation. Throughout the analysis we set A; = i? = log(A^), and let i count 
the number of external edges in F. Let $i,$^,<I>j, resp. denote the contribution to E of all walks 
with exactly t vertices, or exactly £ external edges, or exactly t vertices and i external edges. Clearly, 
E = ^t, E = ^£ and E = J2t e ^t- Since any of these sums has poly{log{N)) summands, and 
since R = log(iV) then (E)^/^ = (1 + o(l))<l>^/^ whenever $ bounds the maximal term among all 
$f. It therefore suffices to show that ^ < {0{\/pN))^. 
We now give a high level description of our improved analysis. We keep 3 ingredients from ||20ll4T]| : 
(i) The entire (ordered) list of vertices is provided. This contributes a Q{N^) multiplicative term to the 
bound on E. (ii) Specific steps are encoded by a symbol from a fixed alphabet (the original alphabet is 
{+, —, neutral}; our final alphabet will be slightly larger). This contributes a (0(1))^ multiplicative 
term to E. (iii) Since T traverses at least t — 1 edges, equation (|7]) bounds the contribution of each walk 
to E by p^^^. Thus, the combined contribution of (i)(ii) and (iii) to the bound on A becomes (after taking 
the i?'th-root) 

e((piv)*/^). (8) 

The latter partial encoding of (i)+(ii) is not 1:1, because of the neutral and the critical steps. Recall 
that there are {t — l) edges in T, and since (as mentioned above) all edges are traversed at least twice, then 
r has exactly {t — 1) (+) steps, exactly {t — 1) (-) steps, and m = R — 2{t — 1) neutral steps. In ||20ll4n . 
the trivial bound is used for the contribution of the neutral steps to the total number of code-words. We 
strengthen |[20ll4T1l . mainly by the following observations. First, we note that whenever i is 'very large', 
then the entire contribution of all such paths to A is negligible. Next (and perhaps most significantly), we 
show that whenever i is not 'very large', the following holds: (i) Half of the neutral steps can be encoded 
very economically, reducing the t*" term from |[20ll4l1l into roughly t^-^^_ (u) All critical steps can be 
encoded so economically, that their entire contribution is (almost) dominated by that of the neutral steps. 
Consequently, as t < 0{pN), we conclude that A is bounded by (roughly) Q{pN)R x 0{pN)2R (the 
first term stems from equation (H), the second from the neutral steps). Since ^ ~ 0.5 — the latter 
bound becomes 0{^JpN), as desired. Details follow. 
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Handling the 'non-typical' walks (large i). Clearly, the contribution of all {t, £)-walks to E is bounded 
by B = p^{pNY~^t^N. Indeed, N^t^ clearly bounds the number of walks, and by equation ^ the 
contribution of each walk is bounded by Let £ > 41oglog(A^). Since t — 1 < 0.5R,R = 

log(iV), andpiV < 0(log^(iV)), we have (piV)*-i < log(iV)(4+°(i))(0-5 i°gW) = iog(iV)(2+o(i)) log(Af)^ 
N = log(iV)°(i°sW), < log(Ar)iogW, and / < (Ari-o(i))4iogiog{iV) ^ iQg(^)(-4+o(i)) iog(7V)_ 
Consequently B < log(iV)(-'^+2+i+«(i)) i°s(^) < 1. This concludes the treatment of the non-typical 
walks. 

Handling the 'typical' walks (small i). A new encoding is required to handle the typical walks. As 
before, the encoding includes the names of all t vertices, ordered by their first appearance, and all (+) and 
all non-critical (-) steps are simply encoded by and (-) and decoded trivially. Our new perspective 
is thinking of the entire walk as composed of sequences of internal steps separated by external steps. 
We first encode the external steps economically, and prove that this enables to economically encode the 
critical steps as well. Next, we handle the internal sequences. We first provide some general observations 
regarding arbitrary internal sequences. Then, we describe how to handle the specific case of encoding a 
closed internal sequence. Finally, we generalize the latter to encoding open internal sequences as well. 

8.2.1 Encoding external steps. 

To exploit the small number of external edges, we add the following to the code. 

• A list of all £ external edges ei, e^. 

• Each external step on Cj is encoded by (i, d), where the bit d specifies the direction in which is 
traversed. 

Thus, encoding a singe external step has only 2i = 0(log log(iV)) possible values. This improves upon 
Il20ll4ni where such steps are encoded by their end- vertex which might have 0(log(A^)) possible values. 

8.2.2 Encoding critical steps. 

Recall that a step s is critical in F, if s is taken from a vertex x, s.t. that x has d > 2 critical edges 
for s. Critical edges are internal edges ei, that touch x and have been traversed exactly once up 
to s. We will show that each Cj can be associated with a unique external edge e. This wiU enable us to 
encode s using e which has only 0(log log(A^)) possible values. Specifically, consider any critical edge 
Cj = {x, w} for s which is not the first edge leading to x in F. Consider the step Si where Cj is traversed 
for the first time in F. Since x had already appeared in F, then Si = {x ^ w). If we omit Cj from T, we 
partition T into 2 disjoint sub-trees: Ti which contains x and T2 which contains w. Since is critical 
then the first time we return to x (after Sj), is not via {w — > x). Thus, there must exist some external 
edge e (that connects T2 to Ti) that is traversed between Si and the first time we return to x. We call 
the first of these edges e the external criticality edge (ECE) of e^, and denote it by c(ej, s). Clearly, 
different Cj-s have distinct ECEs, and in addition, at step s all ECEs are well defined by previous steps in 
F. Consequently, the following encoding is un-ambiguous. 

• Let s be a critical step from x, with critical edges ei, e^, and external criticality edges c(ej, s). 
If s traverses the first edge that leads to x in F we encode s by (-). Otherwise, we encode s by the 
position of c(ej, s) in the list of external edges. 

Note that not-critical negative steps are encoded by (-) as before. This concludes the treatment of external 
and critical steps. We finally encode neutral internal steps. 
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8.2.3 Encoding an arbitrary sequence of internal steps. 

Let S = (si, Sq) be a 'maximal' sequence of internal steps. Here maximality means that (i) either the 
step previous to si was external or that si is the first step in the entire walk, and that (ii) either the next 
step after Sq is external, or that Sq is the last step in the entire walk (maximality does not mean that there 
are no longer internal sequences S' in T). We remark that, in general, some of the edges used by S may 
have been traversed before S started and some may be introduced by S for the first time. 

Let X be the starting vertex of S. Fix some vertex w ^ x visited by S and let u = u{w) denote 
the predecessor of tt; in 5" (so the first time we reach u; in S" is via (n w)). Clearly, after each time 
we step u ^ w, then the only way to return to u is by stepping w —>■ u (otherwise we get a cycle from 
u to itself in T). Thus, when we pass e for the j'th time during S we go forward {u w) when j 
is odd and go backward {w —>■ u) when j is even. We call this the forward-backward observation. 
Since the predecessor is uniquely determined by previous steps in F, we can encode backward steps very 
economically. 

• A neutral-backward step is (economically) encoded by {nh). 

• A neutral-forward step {u —>■ w) is (explicitly) encoded by (n/, w). 

Given this, we desire to demonstrate that many of the neutral steps in S go backward. We first handle the 
following simple case. 

8.2.3.1 Encoding a closed sequence. 

Assume S is closed, namely, the end vertex of Sq is the starting vertex, x, of si. We claim that at least 
half the neutral steps in S go backward. We actually prove the latter for every edge e in S. Let #/(e) and 
#6(e) resp. denote the number of forward and backward steps on an arbitrary edge e during S. Note that 
currently, not only neutral but also (-I-) and (-) steps are counted. We show that #/(e) = #6(e). Indeed, 
otherwise, by the forward-backward observation the last step on e was a forward step Sj = (u — > w), 
and clearly there exists a path from x to u in T. However, since S is closed there must exist another path 
in T from w to x that avoids stepping {w u) - a contradiction. Now, let #nf{e) and ^nb{e) count 
the number of neutral-forward and neutral-backward steps on e during S. By the above, there are at least 
2 steps on e. There are 3 cases: (i) If e was never used prior to S the first step is (+/), the second is 
(—6). The next steps (if any exist) come in pairs of (n/)(n6). (ii) If e was used at least twice prior to 
S, then all steps come in pairs of {nf){nb). (iii) If e was traversed exactly once prior to S, the first 2 
steps are {—f){nb), and all consequent steps (if any exist) come in pairs of {nf){nb). In cases (i),(ii) 
we get ^nf{e) = #nb{e) and in case (iii) j^nf{e) = i^nb{e) — 1. Anyway, at least half of the neutral 
steps in a closed-internal sequence can be encoded economically. This concludes our analysis for closed 
sequences. 

The problem is that for open sequences, S, it might hold that all m{S) neutral steps in S aie forward, 
and by the 1201 |4TI encoding-scheme these steps contribute a huge t'^^^') factor to the bound on the 
number of code words. To overcome this, we use the following. 

8.2.3.2 Encoding an open sequence. 

Let X ^ y denote the start-vertex and end-vertex of some open maximal internal sequence S. Clearly, 
in T there exists a unique path P = {x = xi X2 — > ... —>■ Xr = y)- AH the steps in S can be 
uniquely partitioned into 2 categories, (i) Steps that traverse P (either forward (xi — > Xj+i) or backward 
(xj+i Xi)). (ii) Entire sub-sequence S', where each S' starts and ends at some path-vertex Xi, but 
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never touch P at any other vertex other than Xj. Such S' is a closed internal sequence and is encoded as 
discussed in Section 8.2.3.1. We first modify the encoding simply s.t. path steps are exphcitly encoded 
as such. 

• Each positive-path step is encoded by {+p). 

• Each forward-neutral-path step is encoded by (npf). 

• Each backward-neutral-path step is encoded by (npb). 

• Each forward-negative-path step is encoded by (—pf) (if the step s is critical and has an external 
criticality edge, the index of this edge in the list of external edges is added to the encoding of s as 
in Section [8T2I1. 

• Each backward-negative-path step is encoded by (—pb). 

Clearly, if the entire path P is known, then the latter encoding suffices to decode any path-step, be- 
cause on a path there is a unique forward-step and unique backward-step from each vertex. The question 
is how to recover the path itself. First, the end vertex y of P is well defined by the encoding. Indeed, if 
S is immediately followed by an external step s, than the encoding of s determines y. Otherwise, y is 
the last vertex in F, which is the (already known) first vertex of F. To recover the remaining vertices in 
P, we call a vertex on P either old or new according to whether it appeared in F prior to S or not. If all 
vertices are old, since we know x and y, and since there is a unique path connecting x to y in T, then P 
is uniquely defined. Otherwise, some vertices in P are new. We claim that no new vertex is followed in 
P by an old vertex. Otherwise, the path P includes a step Xj — > Xj+i where Xi is new but Xj+i is old. 
This means that before S started there was a path in T from Xj+i to xi (at each point in the walk, there 
exists a unique sub-tree of T that spans all the vertices traversed so far). Thus, P closes a cycle in T from 
Xj+i to itself - a contradiction. Therefore, if there are any new vertices there exists a unique final old 
vertex x along P. If x is known, then the path from x to x is unique (since there is a unique path between 
any vertex-pair in T). In this case, the other part of P from x to y is also well defined, because it con- 
sists only of new vertices (recall that the order in which new vertices appear in F is explicitly encoded). 
This covers all possible cases. Note that actually, if all steps on P are (-1-) and (-), then they are already 
uniquely decodable as before. Thus, the only addition required for decoding path steps is: 

• Let S be an open internal sequence, with path P that contains at least a single new vertex and at 
least a single neutral step. Let x be the final old vertex in P. Then the symbol x is added to the 
encoding of the first neutral path step in S. 

The main benefit here is that instead of encoding the end-point of each forward neutral step, it suffices to 
encode once the entire 'direction' of the path (this approach is similar to the ll20l |4TI encoding of critical 
steps). 

Wrapping up. 

By all the above, the final encoding (including all aforementioned modifications) is 1:1 as desired. We 
currently fix any £ < 41oglog(A^) and bound the contribution Et/ of all (t,^)-walks to E. Specifically, 
we bound the contribution of various parts in our encoding to Ef^i. Each contribution introduces a new 
multiplicative term to Et/. As before, 

• Choosing the (list of ordered) t vertices to appear in F contributes iN)t = @{N^). 
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• The basic encoding of each step as some combination of positive/negative/neutral path/non-path 
forward/backward contributes (0(1))^. 

• The contribution of each single walk F is 

We now consider the critical and neutral steps. Recall m is the total number of neutral steps. Let m\ 
count the number of external steps. Let m2 count the neutral steps in closed internal sequences. Let ma 
count the number of open internal sequences S s.t. their path P = P{S) contains at least a single neutral 
step and at least a single new vertex. 

• Choosing the I external edges to appear in T contributes ) = 0{t^^). 

• By Section [8.2. 1 [ encoding the external steps contributes (2^)*"^ 

• By Section [8.2.2[ encoding the critical steps contributes at most {I + 

• By Section 8.2.3.1 encoding the neutral steps in closed internal sequences contributes at most 

^0.5m2 

• By Section 8.2.3.2 encoding the neutral steps on the paths of open internal sequences contributes 
at most 

Recall that to bound A we are about to take the i?'th root of E, and that we are willing to tolerate small 
G factors in the bound on A. Since there are only (log(A^))®*^^^ possible t,i,mi,m2,m3, and since 
(log{N))^^ R^ = 1 + o(l), then we may consider only the choice of t, i,mi,m2,m3 that maximizes 
the bound (on the contribution to E). In addition, we may (i) Ignore the (0(1))^ factor from encoding 
specific steps as a combination of {+, — , n, p, /, b}. (ii) Consider A^*~^ instead of (because Nr = 2). 
(iii) Ignore the t^^ factor from the choice of external edges (since < log(A^)Si°s'°s(^) = 2°(^)). (iv) 
Replace the {i + 1)*"^ term with (^)*. 

Combining all the remaining (un-ignored) terms yields the following expression 

Asmi + t < log{N), and p > Ar-i+°(i), then = 2-(i-°(i))^i°g(^)2+'°sWi°g(^) < 1. Next, 

consider any open internal sequence S counted in ms. For any such S (except possibly the last one), there 
exists a unique neutral-external step that terminates S, so < 0.5(m — mi) + 1. Thus, 0.5m2 + < 
0.5m + 1 = 0.5(i?-2(t-l)) + l. Thus,t < piV implies ^' < (piV)*-itO-5(^-2{*-i))+i < (piV)0-5«+i. 
All the above gives A < G(l)^'i/-^ = e{^/pN)- ■ 
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